Speeding up keyword search for search engines
نویسندگان
چکیده
In the current information age, the dominant method for information search is by providing few keywords to a search engine. Keyword search is currently one of the most important operations in search engines and numerous other applications. In this paper we propose a new text indexing technique for improving the performance of keyword search. Our proposed technique not only speeds up searching operations but also the operations for inserting and for deleting keywords, which are particularly important for the ever increasing and dynamic changing databases such as that for search engines. We propose to partition all keywords into search trees based on the first character and the length of the keywords. Our partitioning scheme creates a much more even distribution of keywords and results in a 32% speedup in the worst cases and a 1% speedup in the average cases in comparing to one of the leading text indexing techniques called burst tries. In addition, our proposed technique stores document indexes only at the leaf nodes of the search trees and results in efficient algorithms for searching, insertion, and deletion of keywords. We successfully integrated the technique into our Information Classification and Search Engine system and showed its potential and feasibility.
منابع مشابه
Do Longer Queries Retrieve More Diverse Results?
In this paper, we demonstrate that verbose and grammatically complex queries retrieve, on average, more diverse results across different search engines than the short keyword queries. Our evaluation using both commercial and open source search engines shows that the overlap between search engine results decreases by up to 50% as a function of query length.
متن کاملAn Effective Path-aware Approach for Keyword Search over Data Graphs
Abstract—Keyword Search is known as a user-friendly alternative for structured languages to retrieve information from graph-structured data. Efficient retrieving of relevant answers to a keyword query and effective ranking of these answers according to their relevance are two main challenges in the keyword search over graph-structured data. In this paper, a novel scoring function is proposed, w...
متن کاملSearch Engines Going beyond Keyword Search: A Survey
In order to solve the problem of information overkill on the web or large domains, current information retrieval tools especially search engines need to be improved. Much more intelligence should be embedded to search tools to manage the search and filtering processes effectively and present relevant information. As the web swells with more and more data, the predominant way of sifting through ...
متن کاملKeyword stuffing and the big three search engines
Purpose – The purpose of this research project was to determine how the three biggest search engines interpret keyword stuffing as a negative design element. Design/methodology/approach – This research was based on triangulation between scholar reporting, search engine claims, SEO practitioners and empirical evidence on the interpretation of keyword stuffing. Five websites with varying keyword ...
متن کاملAn Efficient Approach for Keyword Selection; Improving Accessibility of Web Contents by General Search Engines
General search engines often provide low precise results even for detailed queries. So there is a vital need to elicit useful information like keywords for search engines to provide acceptable results for user’s search queries. Although many methods have been proposed to show how to extract keywords automatically, all attempt to get a better recall, precision and other criteria which describe h...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2004